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ABSTRACT 


In  this  paper  a  discussion  and  re-analysis  of  experimental 
data  is  carried  out.  The  data  consist  of  measurements  of  the 
performance  on  two  tasks  (verbal  and  spatial)  that  are  performed 
by  the  two  different  brain  halves.  The  original  experiments 
and  analysis  had  been  conducted  by  R.  Klein  and  R.  Armitage 
in  the  investigation  of  ultradian  rhythms  in  humans.  The 
period  of  the  ultradian  cycle  had  been  hypothetically  equated 
to  the  period  of  REM-sleep  occurrence.  In  the  re-analysis, 
previously  unmentioned  inhomogeneity  was  discovered  and  the 
findings  by  the  original  experimenter-analysts  could  not  be 
confirmed  at  the  stated  significance  levels.  A  discussion  of 
and  recommendations  for  the  experimental  set-up  are  included. 

The  re-analysis  was  conducted  with  non-parametric  methods. 

Small  ultradian  rhythmicity  was  concluded  to  have  been  induced 
by  influence  of  circadian  rhythm,  experimental  set-up  and 
inhomogeneity  among  subjects. 
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I.  DISCUSSION 


A 


A.  INTRODUCTION 

An  awareness  and  interest  in  various  cyclic  phenomena  is 
not  a  recent  development.  Man,  since  his  earliest  existence, 
has  been  aware  of  cyclical  influence  in  his  environment:  day 
changed  to  night,  only  to  repeat  itself  about  12  hours  later; 
spring,  summer,  fall  and  winter  followed  each  other  continuously; 
the  recurrent  changing  of  the  moon,  ebb  and  flow  of  tides  and 
the  female  menstrual  cycle  were  obvious.  The  Greeks,  however, 
were  perhaps  the  first  to  actually  apply  a  knowledge  of  cycles 
in  an  attempt  to  better  understand  their  environment.  Luce 
[1970]  suggests  that  the  Greeks  applied  their  knowledge  of 
cycles  in  the  treatment  of  certain  illnesses  over  2000  years 
ago . 

This  knowledge  and  interest  was  obscured  during  the  period 
known  as  the  Dark  Ages.  At  the  end  of  the  19th  century  a 
theory  of  biological  cycles  known  as  biorhythm  rekindled  the 
interest  and  stimulated  research  in  the  area.  During  the  20th 
century  the  interest  has  intensified  with  more  cycles  being 
identified  and  serious  research  activity  devoted  in  an  attempt 
to  understand  the  nature  of  the  various  biological  cycles. 
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B.  DEFINITION  AND  EXPLANATION  OF  CYCLES 

A  cycle  is  a  course  or  series  of  events  that  recur  regularly 
and  usually  lead  back  to  the  starting  point.  [Webster,  1976] 
Mathematically,  a  function  is  cyclic  with  period  T  if 
X(t+kT)=x(t)  for  k=0 , -1 , -2 , .  .  . 

A  rhythm  is  a  regularly  recurrant  quantitative  change  in  a 
biological  process.  [Webster,  1976] 

A  rhythm  -  as  defined  above  -  is  produced  by  oscillators; 
these  can  be  either  linear  or  non-linear  oscillators,  thus 
causing  linear  (e.g.  sinusoidal)  or  non-linear  (e.g.  relaxation, 
"saw  tooth")  oscillations. 

Just  as  the  cycles  observed  in  man's  environment  since 
ancient  times  occur  in  different  frequencies,  the  periods 
(1  over  frequency)  of  rhythmic  activities  in  biological 
systems  vary  from  seconds  or  minutes  (e.g.  in  cell  growth 
and  division)  to  hours  (e.g.  the  24-hour  cycle  of  the  circadian 
rhythm)  to  days  or  weeks  (e.g.  the  28-day  clcle  of  female 
menstruation) . 

The  terminology  of  cycle  research  is  grouped  around  the 
most  obvious  one,  the  cycle  with  a  period  of  "about  a  day" 

(about  24  hours) .  Dr  Franz  Halberg  coined  the  term  "circadian 
rhythm"  for  this  from  Latin  "circa  -  about"  and  "dies  -  a  day." 
Cycles  with  periods  shorter  than  a  day  are  called  ultradian, 
those  with  periods  longer  than  a  day  are  refereed  to  as 
infradian  cycles. 
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There  are  essentially  two  different  possible  explanations 
of  rhythmic  activities  in  biological  systems.  The  first  is 
that  it  is  an  essential  dynamical  feature  of  the  process 
observed,  i.e.  it  is  part  of  the  process  and  occurs  within 
it  at  a  certain  place  regardless  of  when  and  where  the  whole 
process  takes  place.  The  second  is  that  such  rhythms  represent 
adaptive  responses  of  the  organism  to  a  periodic  environment; 
i.e.  if  the  organism  were  not  or  had  not  been  exposed  to  the 
periodicity  (i.e.  entrained  by  zeitgeber,  a  periodic  clue¬ 
giving  environmental  feature) ,  it  would  not  show  the  rhythm. 
[Oatley  and  Goodwin,  1971] 

An  example  for  the  first  possible  explanation  -  a  dynamical 
feature  regardless  of  the  environment  -  is  found  in  cell  growth 
and  division.  If  the  cell  division  occured  when  triggered  by 
the  environment,  it  could  easily  happen  that  the  resulting 
daughter  cells  would  not  receive  a  full  set  of  the  hereditary 
material  (DNA) ,  would  then  be  deficient  in  genes  and  would  not 
be  able  to  survive.  Considering  the  enormous  number  of  cell 
divisions  it  takes  to  produce  a  new  life,  the  probability  of 
a  healthy  being  to  result  from  a  cell  division  process  that 
was  environmentally  induced  is  very  small. 

The  second  possible  explanation  -  entrainment  by  zeitgeber  - 
appears  to  account  for  all  those  cases  that  seem  to  have 
originated  as  a  result  of  adaptation  to  a  periodic  environment. 
The  most  often  discussed  and  rather  obvious  zeitgeber  is  the 


24-hour  day,  held  responsible  for  entrainment  of  the  circa¬ 
dian  rhythm;  another  is  the  12-hour  tidal  cycle.  Its 
entrainment  has  been  studied  in  marine  (littoral)  organisms. 

A  most  difficult  to  explain  phenomenon  occurs  when  the 
frequency  of  one  internally  caused  cycle  falls  into  the  range 
of  an  environmentally  induced  cycle.  The  resulting  complicated 
periodic  organization  then  has  to  be  taken  as  partly  adaptive 
and  partly  of  internal  orgin.  [Oatley  and  Goodwin,  1971]  Bruce 
[1965]  demonstrated  that  this  is  often  the  case  for  cell 
division  of  multicellular  organisms.  Generation  times  were 
found  to  converge  in  the  neighborhood  of  24  hours  or  a 
multiple  thereof.  Interaction  of  this  kind  may  also  occur 
when  there  are  simple  multiple  relations  between  frequencies 
like  the  24-hour  cycle  and  the  28-day  cycle. 

C.  CIRCADIAN  AND  ULTRADIAN  RHYTHM  RESEARCH  DURING  SLEEP 

While  psychological  and  physiological  phenomena  of  the 
circadian  as  well  as  of  the  infradian  rhythm  have  been 
researched  extensively  [Kleitmann,  1949;  Harker,  1958; 

Aschoff,  1963;  Lobban,  1965;  Colquhoun,  1971],  ultradian 
rhythms  have  only  been  studied  during  sleep-phases  or  in 
relation  to  sleep.  Ultradian  rhythms  are  those  with  a  period 
shorter  than  24  hours,  a  frequency  thereby  of  more  than  one 
per  day.  With  the  use  of  electroencephalograph  (EEG)  that 
records  voltages  of  the  biological  system  through  sensors 
connected  to  various  areas  of  the  skull,  the  depth  of  sleep  - 
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among  other  things  -  can  be  measured.  Not  only  have  different 
levels  of  sleep  been  identified  but  it  has  also  been  found 
that  one  of  the  levels  recurs  in  a  cycle  of  about  90  minutes. 
This  level  or  stage  is  the  rapid  eye  movement  stage  (REM-stage) 
named  after  its  physiological  phenomenon.  Another  name  for  it 
is  dream-sleep-stage,  since  it  is  during  this  stage  that  a 
person  dreams.  The  REM  cycle  is  the  best  researched  of  all 
ultradian  cycles  in  animal  and  human  observation.  Hartmann 
[1967]  quotes  the  average  lengths  in  minutes  as  follows: 
mouse  (3-4),  rat  (7-13),  rabbit  (24),  opossum  (17),  cat  (20-40) 
monkey  (40-60),  man  (80-90),  elephant (120) .  Kleitmann  [1967] 
has  attributed  it  to  the  change  in  metabolic  rate.  Be  this 
as  it  may,  the  REM  cycle  during  sleep  is  an  excellent  example 
of  an  ultradian  rhythm. 

D.  ULTRADIAN  RHYTHM  RESEARCH  DURING  WAKEFULNESS 

These  findings  accepted,  the  logical  question  to  ask  was 
whether  a  continuation  of  the  REM  cycle  during  the  wakeful 
state  could  be  hypothesized  and  observed.  Kleitmann  [1963] 
suggested  that  the  basic  rest-activity  cycle  of  80-90  minutes, 
as  seen  in  the  recurrant  stages  associated  with  dreaming  in 
actual  sleep,  may  persist  during  the  waking  period  also, 
manifesting  itself  in  recurrant  fluctuations  of  alertness. 
[Colquhoun,  1971] 

From  there  on  it  was  only  a  question  of  time  until 
scientists  would  look  for  parallels  between  the  -  by  now  well 


researched  -  field  of  sleep  and  that  of  wakefulness.  In  1922 
a  group  of  subjects  had  been  kept  in  bed  all  day  and  a  roughly 
90-minute  rhythm  of  body  movement  and  stomach  contraction  had 
been  observed.  [Luce,  1970]  Oral  activity  (eating,  drinking, 
smoking)  was  found  to  increase  and  decrease  with  a  range  of 
85-110  minutes  with  a  mean  of  96  minutes.  [Friedman,  1965] 

E.  ASSOCIATED  RESEARCH  PHENOMENA 

Scientific  biological  experimentation  has  always  focused 
on  animals.  Hundreds  of  studies  have  been  conducted  observing 
anything  from  fruit  flies  to  mice  to  monkeys.  While  the 
relative  availability,  short  life  cycles  and  possibility  of 
complete  destruction  made  observation  and  evaluation  in 
animals  easier  than  in  human  experiments,  it  also  seems  to 
have  led  astray  many  scientists  and  their  audiences  into  the 
belief  that  primitive  creatures  certainly  followed  certain 
instincts  and  reflexes  in  leading  their  somewhat  artificial 
lives.  Many  humans,  however,  would  not  readily  accept  that 
man  himself,  considered  the  crown  of  creation,  still  followed 
old  innate  or  environmentally  entrained  oscillators  that 
contribute  to  many  of  his  emotions,  actions,  reactions  and 
physiological  phenomena. 

It  is  also  true  that  biological  research  in  the  field  of 
cycles  and  rhythms  is  most  difficult  to  perform  on  humans.  As 
man  subjects  himself  to  externally  imposed  schedules  of  work, 
sleep,  entertainment  and  food  intake,  the  biologically  "normal" 
cycle  is  often  disturbed  or  willfully  counteracted  (e.g.  by 
night  work) .  1  4 
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F.  IMPACT  AND  IMPORTANCE  OF  RESEARCH 

The  importance  of  research  in  the  field  of  rhythmic  changes 
in  human  performance  becomes  obvious  once  one  realizes  how 
many  actions,  reactions  and  their  effects  or  evaluations  depend 
on  human  reaction  time,  accuracy,  acuity,  steadiness,  etc., 
all  potentially  subject  to  variation  via  internal  "biological 
clocks."  [Kleitmann,  1949;  Harker ,  1958;  Aschoff,  1963]  While 
the  circadian  rhythm  is  generally  accepted  and  -  sometimes  - 
considered  in  the  scheduling  of  human  activities  (e.g.  night 
shifts,  watch  bills),  an  ultradian  rhythm  has  so  far  not  been 
shown  to  occur  during  man ' s  wakefulness  and  has  therefore  not 
been  considered. 

If  it  exists  and  its  existence  makes  a  significant  difference 
in  many  or  all  of  the  above  mentioned  human  activities  and  if 
it  is  predictable,  the  impact  of  its  discovery  could  be 
extensive .  Long  term  employment  and  work  output  may  not  be 
affected  since  in  the  long  run  (e.g.  a  day)  the  ups  and  downs 
would  smoothe  out  to  an  average  performance  level  that  would 
follow  the  circadian  rhythm.  Any  short  term  employment  and 
output,  however,  could  be  affected  significantly.  School  exams 
of  one  hour  duration  could  fall  either  in  the  improved  perform¬ 
ance  period  or  the  degraded  performance  period  of  such  a  cycle; 
reaction  to  warning  signs  or  signals  could  be  impaired  if  the 
signal  presentation  occurred  during  the  "down  period." 

The  most  significant  immediate  impact  would  probably  be 
felt  in  the  scientific  community  where  tests  of  short  duration 


15 


% 


are  very  common.  Medicine  and  medical  research  might  have  to 
revise  some  of  their  findings.  For  example,  human  reaction 
to  drugs  might  be  time-dependent  or  the  effect  of  drugs  could 
be  over-shadowed  by  the  effect  of  the  ultradian  rhythm.  A 
drug  that  had  been  thought  of  as  improving  reaction  time  might 
be  ineffective  and  any  faster  reaction  time  might  have  been 
due  to  an  upswing  in  the  ultradian  rhythm.  The  effect  of  a 
drug  that  had  been  found  ineffective  might  not  have  demonstrated 
desired  results  because  a  subject  was  on  a  downswing  in  the 
ultradian  rhythm.  As  for  the  above  mentioned  drug  experiments 
many  subjects  are  used,  it  is  not  very  likely  that  the 
discovery  or  acceptance  of  an  ultradian  rhythm  would  overthrow 
all  findings,  but  it  should  be  considered  in  the  planning  of 
future  experiments,  especially  those  that  involve  only  few 
subjects  and  are  of  short  duration. 

An  important  question  to  ask  before  the  above  contemplations 
are  considered,  is  whether  performance  is  related  to  the  ultra¬ 
dian  rhythm  -  should  it  exist.  Hunger  (stomach  contractions, 
sucking  movements  of  babies) ,  sex  (penile  erection  during  sleep 
or  very  tired  wakefulness)  and  survival  (REM  sleep  "alertness") 
are  all  neurologically  based  in  the  oldest  part  of  the  brain. 

Does  performance  ever  draw  from  those  parts  or  should  it  be 
influenced  by  them? 
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G.  KLEIN’S  AND  ARMITAGE ' S  [1976]  EXPERIMENTAL  SET-UP 

Based  on  the  previously  mentioned  suggestions  that  a  90- 
minute  cycle  observed  during  sleep  might  continue  in  some  form 
during  the  wake  state,  Klein  and  Armitage  [1976]  of  the 
Department  of  Psychology  at  the  Dalhousie  University,  Halifax, 
Canada,  designed  an  experiment  to  investigate  the  hypothesis 
that  the  above  suggestion  was  true.  In  addition  to  this 
hypothesis  they  also  investigated  previous  findings  by  other 
scientists  [Cohen,  Gross;  Geffen,  et  al.  1972]  which  indicated 
that  the  two  brain  halves  -  the  right  and  left  hemisphere  - 
process  different  kinds  of  information.  According  to  those 
findings  the  right  hemisphere  specialized  in  processing  visual- 
spatial,  wholistic,  non-logical  information,  while  the  left 
half  specialized  in  verbal-linquistic ,  analytic  and  logical 
processes.  Combining  these  two  areas  of  study  the  hypothesis 
under  test  was  that  performance  measurements  on  each  of  two 
tasks  -  to  be  described  below  -  should  reveal  rhythmic 
oscillations  with  a  period  of  approximately  80-120  minutes  and 
that  equivalent  phases  of  the  two  rhythms  should  be  separated 
by  180°  (one  half  cycle) . 

The  experimenters  chose  two  simultaneous  matching  tasks. 

For  the  spatial  task  subjects  were  required  to  match  dot 
patterns  of  seven  dots  each,  the  verbal  task  required  them 
to  match  letters.  Two  dot  patterns  had  to  be  judged  same  or 
different  and  a  capital  letter  was  compared  to  a  lower  case 
letter  and  judged  same  or  different.  The  actual  recording  of 
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the  decision  was  made  by  the  subject  by  crossing  out  S's  or 
D's  for  same  or  different.  The  subjects  were  eight  young 
adult  volunteers  in  the  age  range  of  18  to  24;  there  were 
five  females  and  three  males  who  were  right-handed  and  had 
no  left-handed  siblings  or  parents. 

All  eight  subjects  were  tested  in  a  group  testing  room, 
all  on  the  same  day.  They  had  two  booklets  in  front  of  them 
on  a  desk,  one  for  each  task.  The  booklets  contained  48  pages 
with  96  problems  to  a  page.  (The  booklets  were  to  be  used  over 
again  after  completion.  An  influence  of  the  outcome  by  using 
the  same  problems  over  again  was  not  to  be  expected  since  over 
4000  problems  would  have  to  be  solved  before  they  repeated 
themselves) . 

The  subjects  were  told  to  record  their  same/different  choices 
as  quickly  as  possible  without  making  errors.  They  were  given 
practice  on  each  task  to  account  for  a  possible  learning 
curve  (LC) . 

Performance  was  to  be  measured  on  each  task  every  15  minutes 
for  three  minutes.  The  test  started  at  0900  and  ran  for  8  hours, 
yielding  32  data  points  (observations)  per  task  per  subject. 

Lunch  was  from  1230  to  1300.  The  testing  was  not  interrupted, 
but  subjects  ate  in  the  free  minutes  during  this  period. 

Subjects  were  not  allowed  to  talk  to  one  another  but  could  walk 
around,  draw  or  read.  They  also  reported  in  writing  their 
thoughts  at  the  end  of  every  15-minute  interval.  Dependent 
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variable  for  the  test  was  number  correctly  matched  pairs  in 
each  3-minute  test  period.  This  was  accepted  to  reflect  the 
subjects’  rate  of  accurate  performance  on  the  matching  tasks, 
since  the  error  rate  was  low  (1.6%). 

H.  CONSIDERATIONS  ABOUT  AN  ADVISED  EXPERIMENTAL  SET-UP 

Before  listing  some  general  thoughts  on  experimental  set-up 

for  the  kind  of  experiment  as  the  one  described  in  this  thesis, 
it  should  be  emphasized  that  by  mentioning  a  consideration  or 
recommending  a  procedure  it  is  in  no  way  implied  that  Klein 
and  Armitage  did  not  consider  such  question  in  the  conduct  of 
their  effort.  A  few  of  the  facts  from  their  description  of 
the  experiment  are  contrary  to  recommendations  that  seem  to 
be  justified  from  the  study  of  related  research  phenomena. 

Since  it  is  impossible  to  describe  an  experimental  set-up 
within  a  journal  article  of  reasonable  length  and  mention  all 
reasoning  behind  each  element  of  the  set-up,  any  of  the 
following  recommendations  about  the  conduct  of  such  an  experi¬ 
ment  that  are  not  in  direct  contrast  to  the  way  it  actually 
was  done  should  be  understood  more  as  an  enlargement  of  the 
background  knowledge  than  as  an  insinuation  that  Klein  and 
Armitage  might  have  neglected  to  consider  them. 

I.  DISCUSSION  OF  EXPERIMENTAL  SET-UP 

To  set  up  an  experiment  with  expected  outcomes  of  any 
significance  several  principles  need  to  be  observed: 
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The  number  of  test  subjects  needs  to  be  large  enough 
to  rule  out  findings  by  chance  or  idiosyncracies  of 
the  subjects  chosen.  For  tests  involving  humans  as 
subjects  numbers  have  been  as  small  as  two  in  many 
experiments.  This  is  very  close  to  the  borderline  of 
unacceptability.  For  an  experiment  for  which  it  is 
anticipated  that  the  findings  might  be  generalized  to 
all  humans,  a  representative  mix  of  subjects  has  to 
be  selected.  Men,  women,  young,  old, well  educated 
and  undereducated  are  some  immediate  distinctions. 

The  last  two  categorizations  might  be  especially 
critical  when  the  experiment  is  a  test  and  the 
variable  tested  is  performance. 

The  test  subjects  are  not  to  be  informed  of  expected 
outcomes,  but  need  to  know  in  general  how  the  test/ 
experiment  is  to  be  conducted,  without  knowledge  of 
procedures  and  measurements,  human  curiosity,  anxiety 
or  even  fear  could  seriously  contaminate  test  results . 
The  interaction  between  personnel  administering  the 
test  elements  and  the  Ss  needs  to  be  formalized  and 
strictly  standardized  without  the  least  chance  for  the 
Ss  to  "read  between  the  lines"  of  any  comments. 
Otherwise,  the  Ss  will  tend  to  "deliver"  the  desired 
result  to  the  experimenter.  [Rosenthal-ef feet ;  Russell, 
1977] 


4.  Humans  tend  to  act  differently  when  they  feel  they 
are  being  ovserved  than  when  they  think  they  are 
unobserved.  The  increase  in  activity  or  productivity 
of  working  people  due  only  to  the  fact  that  they  are 
being  "paid  attention  to"  is  known  as  the  Hawthorne- 
effect.  In  performance-observation  experiments  this 
effect  must  not  be  neglected. 

5.  Constant  presence  and  neutral  behavior  of  the  administer¬ 
ing  personnel  are  of  paramount  importance,  if  the 
experiment  measures  possible  performance  variation. 

If  interspersed  absence  of  the  administering  personnel 
is  necessary,  the  times  and  lengths  of  absence  need 
to  be  randomized  and  noted  in  relation  to  experimental 
outcomes . 

6.  Any  regularity  during  an  experiment  directed  to 
investigate  rhythmic  phenomena  has  to  be  avoided  in 
order  not  to  induce  an  artifical  rhythm  or  cycle  or 
a  harmonic  of  it. 

7.  Physical  separation  of  subjects  would  be  helpful  to 
ensure  absolute  non-interference  between  them.  In 
tests  like  the  one  studied,  where  booklets  have  to 
be  moved  and  pages  are  turned  or  moved,  the  simple 
sould  of  one  S  turning  a  page  may  be  enough  to  evoke 
a  "competitive  spirit"  in  another.  [Blake,  1970] 
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B.  Immediately  connected  to  avoidance  of  regularity  and 


to  separation  of  Ss,  the  next  aspect  of  good  experimental 
set-up  would  be  temporal  separation  of  the  subjects.  Not 
only  would  possible  interaction  between  them  be  eliminated, 
but  also  the  circadian-rhythm-and  external  everyday- 
life-cycle-induced  regularities  would  be  kept  from 
contaminating  the  test  results. 

9.  A  different  approach  to  the  randomization  of  the  times 
of  test  would  be  to  use  a  smaller  number  of  Ss  - 
thereby  increasing  the  homogeneity  of  the  Ss  -  and 
have  them  perform  on  very  similar  tasks  elements  over 
several  8-hour-time  blocks  -  beginning  at  different 
times  -  for  several  days  to  obtain  at  least  the  same 
amount  of  data  points . 

10.  The  biggest  interference  with  any  test  of  a  length  of 
time  greater  than  time  between  meals,  of  course  is  a 
meal.  Among  the  meals  again  it  is  lunch  that  seems  to 
have  the  greatest  effect  on  the  human  daily  routine. 

While  lower  animals  can  be  turned  around  completely 
and  their  biological  clocks  readjusted  [Lobban,  1965], 
this  is  not  easily  done  in  humans.  A  contributing 
factor  here  are  the  social  constraints  that  mankind 
has  subjected  itself  to  and  that  are  difficult  to 
alter.  Lunch  has  become  a  social  function  during 
which  ideas  are  discussed,  business  deals  are  talked 
over,  friends  are  met,  the  workplace  is  left  and  in 
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many  cases  the  family  or  home  is  visited.  EsDecially 
in  the  last  two  aspects  the  lunch  break  or  lunch  time 
has  become  something  to  look  forward  to,  to  work  up 
to,  to  condition  one's  mind  towards.  It  cuts  the 
working  day  in  half,  is  something  desirable  and  a 
great  motivator.  The  administrator  of  an  experiment 
that  extends  through  the  normal  lunch  period  has  to 
be  aware  of  this  conditon  of  the  human. 

Besides  the  psychological  phenomenon  of  looking  forward  to 
that  time  of  day,  a  physiological  phenomenon  exists.  By  about 
lunch  time,  breakfast,  the  earlier  meal  of  the  daily  eating 
routine,  has  been  digested  and  the  body  is  running  out  of 
nutrients  to  keep  working  at  the  same  pace  as  it  has  been 
during  morning;  the  blood  sugar  level,  for  example,  reaches  a 
low  level.  [Haggard  and  Greenberg,  1935]  The  body  is  ready  for 
a  refill.  It  reminds  the  person  of  this  fact  by  a  feeling  of 
hunger.  Following  this  feeling,  the  person  eats  and  experiences 
that  the  blood  -eager  to  assimilate  the  nutrients  from  the 
food  -  fills  the  capillary  system  of  the  intestinal  digestive 
tract  rather  than  that  of  the  brain.  A  decrease  in  the  amount 
of  blood  in  the  brain  area  sets  in  and  results  in  tiredness. 
Instead  of  following  this  feeling,  however,  and  taking  a  siesta 
or  a  long  noon  break  as  is  done  in  southern  countries  -  the 
average  person  in  industrialized  countries  continues  to  work 
following  lunch.  With  reduced  efficiency  and  reduced  presence 
of  mind  and  responses,  the  probability  of  errors,  misjudgements, 
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erratic  and  useless  movements  increases,  often  resulting  in 
accidents,  injury  or  death;  always  resulting  in  reduced 
performance.  As  for  the  experimental  set-up,  the  evaluation 
of  various  studies  concerning  effects  of  food  intake  on 
physical  efficiency  and  mental  acuity  needs  to  be  taken  into 
acocunt.  [Bjerner,  Holms Swensson ,  1955;  Haggard  &  Greenberg, 
1935]  In  a  long-term  study  (19  years) ,  reading  errors  of 
meter  readers  (in  Swedish  gas  works)  were  recorded  (Figure  1) ; 
"reading  errors  were  most  frequent  shortly  after  the  midday 
break,  during  meal  digestion."  [Grandjean,  1970]  Measurements 
of  blood  sugar  level  and  respiratory  quotient  (Figure  2)  during 
experimentally  varied  timings  of  meals  indicated  that  low 
levels  of  these  two  indirect  measures  of  efficiency  can  be 
avoided  by  increasing  the  number  of  meals  to  five  and  spacing 
them  less  far  apart.  [Haggard  &  Greenberg,  1935]  [Figure  3] 
These  findings  have  been  confirmed  by  other  experimetns. 

Not  for  performance  reasons  initially,  but  for  health 
reasons,  dieticians  have  recommended  the  same  timing  of  meals 
to  relieve  the  body  from  the  extremes  of  fullness  and  emptiness 
and  have  its  metabolism  instead  perform  at  a  rate  that  is  more 
moderate  and  closer  to  constant. 

The  administering  of  in-between-meals  during  an  experiment 
researching  cyclical  phenomena  is  therefore  advisable  since  it 
will  smoothe  out  physiologically  induced  variations  of 
performance . 


Any  degradation  of  performance  around  the  lunch  time  of 


the  subjects  -  as  observed  in  the  data  under  discussion  - 
should  therefore  not  be  attributed  to  an  ultradian  rhythm 
without  keeping  in  mind  that  it  is  also  the  expected  occurence 
following  the  circadian  rhythm  of  the  average  subject. 


FIGURE  1.  READING  ERRORS  OF  GAS  METER  INSPECTORS,  COMPILED 
OVER  19  YEARS.  READING  ERRORS  REACH  A  MAXIMUM  IN 
THE  EARLY  AFTERNOON  (DURING  THE  DIGESTIVE  PERIOD) 
AND  DURING  THE  NIGHT  WORK  IN  THE  EARLY  HOURS  OF 
THE  MORNING.  [Bjerner,  Holm,  Swensson] 


11.  A  further  point  of  consideration  in  the  design  of  an 

experiment  during  which  measurements  are  taken 
repeatedly  is  the  possibility  of  the  subject's 

improvement  due  to  a  learning  curve  (LC) .  In  the 

experiment  under  discussion  the  test  problems  were 

arranged  in  booklets  of  48  pages  each  with  96  problems 

to  a  page.  A  subject,  therefore,  had  to  complete  well 


25 


quotient  respiratory  quotient 


Number  of  hours  after  meal 


FIGURE  2. 


EFFECTS  OF  TWO  MEALS  (ABOVE)  AND  OF  FIVE  MEALS 
(BELOW) .  THE  RESPIRATORY  QUOTIENT  CHANGES  WITH 
THE  BLOOD  SUGAR  AND  CAN  BE  REGARDED  AS  AN  INDIRECT 
MEASURE  OF  EFFICIENCY.  WITH  FIVE  MEALS  THE  VALUES 
DO  NOT  DECREASE  AS  MUCH  AS  WITH  TWO  MEALS,  SO  THAT 
EFFICIENCY  REMAINS  HIGHER  THROUGHOUT  THE  WORKING 
DAY.  [Haggard  &  Greenberg] 
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Traditional  distribution  of  working  time 
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Shortened  mid-day  break  with  in-between  meals 
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3.  WORKING  TIME,  FOOD  INTAKE  AND  READINESS  TO  PERFORM. 
DURING  WORKING  HOURS  THE  FOOD  INTAKE  SHOULD  CONSIST 
OF  TWO  SNACKS  AND  A  SMALL  MID-DAY  MEAL  FOR  REASONS 
OF  HEALTH  AND  WORKING  EFFICIENCY.  [Grand jean] 


over  4000  problems  before  she  or  he  would  return  to 
the  same  problem.  A  LC  due  to  same  problems  can 
consequently  be  ruled  out.  The  existence  of  a  LC 
in  as  far  as  finding  a  method  to  mark  the  answers  in 
the  best  way  was  intended  to  also  be  ruled  out  - 
"subjects  were  given  practice"  [Klein]  -  the  data, 
however,  indicate  a  general  performance  increase 
(increase  in  number  of  problems  solved)  for  the 
first  few  test  intervals. 
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Last  not  least  it  must  be  understood  that  subjects  will 
introduce  their  own  bias  into  an  experiment.  Any  effort  which 
requires  subjects  to  react  in  two  ways  that  determine  one 
outcome  which  is  then  measured:  In  Klein's  experiment  the  Ss 
were  asked  "to  make  their  same/different  choices  as  quickly 
as  possible  without  making  errors."  [Klein,  1979]  This  clearly 
burdens  them  with  the  responsibility  to  decide  for  themselves 
what  is  meant  by  "as  quickly  as  possible"  when  weighed  against 
"without  making  errors."  The  fact  that  they  did  make  some 
errors  indicates  that  they  really  made  their  same/different 
choices  more  quickly  than  possible  without  making  errors.  It 
needs  to  be  examined  which  factor  caused  them  to  "accept"  an 
error  rate  of  1.6.  Had  more  emphasis  been  put  on  accuracy  a 
zero  error  rate  could  have  resulted,  more  stress  on  speed 
could  have  caused  a  higher  error  rate.  This  difference  in 
emphasis  can  be  looked  for  in  the  experimental  instructions , 
in  the  way  the  instructions  were  given  and  possibly  explained, 
but  especially  in  the  Ss '  own  understanding  of  the  instructions, 
anticipation  of  a  desired  result  and  a  personal  weighting  of 
the  respective  importance.  This  individually  different  trade¬ 
off  of  speed  versus  accuracy  influences  the  outcome  and  needs 
to  be  considered. 


II.  STATISTICAL  ANALYSIS 


A.  INTRODUCTION 

The  results  of  the  analysis  performed  by  the  experimenters 
[Klein  and  Armitage]  are  as  follows: 

1.  There  is  an  ultradian  rhythm  of  approximately  90 
minutes  in  the  performance  data. 

2 .  Verbal  and  spatial  scores  tend  to  have  rhythms  at 
this  frequency  which  are  180°  out  of  phase,  i.e., 
verbal  performance  peaks  when  spatial  performance  is 
at  its  lowest  level. 

The  statistical  analysis  given  in  this  thesis  and  which  is 
presented  in  the  next  sections  comes  to  some  distinctly 
different  conclusions. 

1.  Most  importantly  the  behaviour  of  the  subjects  is 
definitely  inhomogeneous  over  the  set  of  subjects. 

It  will  be  shown  that  there  are  two  sets  of  subjects 
which,  within  groups,  show  homogeneous  behaviour  in 
space  (ensemble)  and  time,  but  between  groups  show 
disparate  behaviour  in  frequency  and  time. 

2.  The  only  evidence  for  non-stationary  behaviour  in 
time  in  the  two  subgroups  appears  to  be  a  four  hour 
cycle,  believed  to  be  caused  by  a  lunch-time  effect. 


3.  There  is  very  little  evidence  for  a  strong  out-of¬ 
phase  link  between  spatial  and  verbal  behaviour. 


The  analysis  is  complicated  by  the  fact  that  the  usual 
assumptions  of  homogeneity  in  space  and  time  are  not  valid, 
and  that  we  are  looking  at  eight  bivariate  (verbal-spatial) 
processes.  Clearly  there  will  be  interactions  between  the 
inhomogeneities.  The  inhomogeneity  between  Ss  will  be 
presented  first  since  it  dominates  the  inhomogeneity  in  time. 

It  should  be  noted  that  the  data  set  is  not  large  enough 
to  make  very  fine  distinctions  in  the  underlying  behaviour 
mechanisms.  Thus  the  conclusions  beyond  those  of  subject 
inhomogeneity  are  tentative. 

B.  EXPERIMENTER'S  ANALYSIS 

The  experimenter  converted  the  32  scores  of  each  S  and 
each  task  to  Z-scores  through  the  process  of  normalization 
and  performed  a  Fourier  analysis .  Power  spectral  peaks 
appeared  at  4  hours,  96  minutes  and  at  37  minutes.  Observed 
peaks  were  significantly  larger  than  one  would  expect  by 
chance  (P  .01)  when  tested  through  the  use  of  two  methods: 

In  a  "rank  test"  method  the  power  spectra  of  each  subject 
were  replaced  by  their  ranks  and  then  ordered  increasing 
from  1  to  16.  A  mean  rank  across  eight  Ss  for  each  period 
was  computed.  Mean  rank  above  12.3  were  found  to  be  larger 
than  expected  by  chance  at  the  .01  level  in  a  one-tailed 
test. 

In  a  "randomization  test"  500  random  orderings  of  each 
S's  scores  for  each  task  were  subjected  to  Fourier  analysis. 


Using  the  mean  root  power  of  the  random  sequences  for  each 
subject,  task  and  period  the  probability  was  determined  of 
obtaining,  by  chance,  a  spectral  value  (root  power)  as  large 
or  larger  than  actually  observed.  Combining  the  probabilities 
across  Ss  it  was  determined  for  the  group  which  periods  had 
consistently  larger  peaks  than  would  be  expected  by  chance 
at  the  .01  level. 

C.  CRITIQUE  OF  EXPERIMENTER'S  ANALYSIS 

The  scores  are  actually  quite  complex,  since  they  are 
the  number  of  correctly  performed  tasks  in  the  three  minute 
interval.  No  information  is  available  on  the  actual  number 
of  tasks  attempted.  Thus  the  scores  measure  a  mixture  of 
the  Ss's  speed  and  error-proneness . 

Standardizing  by  converting  to  Z-scores  is  also  open  to 
question.  This  standardization  assumes  normality.  In  fact 
the  scores  are  quasi-count  data,  so  that  the  mean  and  variance 
of  the  scores  will  not  be  independent.  A  square-root  trans¬ 
formation,  both  to  standardize  the  variance  and  induce 
normality  would  have  been  preferable.  An  alternative 
preferred  in  most  of  the  analysis  used  in  this  thesis  is  to 
replace  the  data  by  the  ranks  within  Ss.  As  will  be  seen  in 
the  two-way  layout  given  below,  this  immediately  shows  up 
an  inhomogeneity  in  the  Ss's  behaviour  over  time. 

The  spectral  analysis  is  also  subject  to  criticism.  The 
technique  is  valid  to  test  that  the  spectrum  is  not  flat, 
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with  the  proviso  that  the  maximum  of  the  mean  ranks  must  be 
used  as  the  test  statistic  and  the  subjects  must  be 
homogeneous.  Alternate  tests  for  each  spectrum  are  given 
below.  However,  rejection  of  flatness  of  the  spectrum  could 
indicate  either  that  serial  correlation  is  present  in  the 
data  (very  likely)  or  that  there  is  a  cycle,  or  both.  The 
test  results  given  below  indicate  very  different  results  when 
the  two  subgroups  of  subjects  are  analysed  separately. 

D.  PLOT  OF  RAW  DATA  (PERFORMANCE  SCORES) 

The  analysis  was  begun  by  plotting  the  raw  data.  In  this 
procedure,  the  spatial  scores  were  marked  along  the  horizontal 
axis,  the  verbal  scores  along  the  vertical  axis.  It  was  the 
goal  of  this  plotting,  which  is  shown  in  Figure  4,  to  display 
any  correlation  between  the  two  elements  of  the  paired  test 
scores . 

The  Pearson  product-moment  correlation  coefficient  con¬ 
firmed  the  visually  apparent  slightly  negative  correlation 
between  the  test  scores  of  each  of  the  eight  subjects,  as 
tabulated  in  Table  I . 

The  approximate  standard  deviation  for  the  coefficient 

—  L  _J_ 

in  normal  samples  is  n  =32  =0.177.  Thus  since  we  are 

looking  at  8  of  these  values,  the  maximum  value  of  -.48  is 
probably  not  significant.  There  is,  however,  a  slight 
indication  of  negative  correlation  since  all  the  correlations 
are  negative.  This  question  will  be  returned  to  later  after 
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TABLE  I.  CORRELATION  COEFFICIENT  OF  VERBAL/ SPATIAL  PERFORM' 
ANCE  SCORES. 


E.  FRIEDMAN'S  TWO-WAY  ANALYSIS  OF  VARIANCE 

In  the  following  section  a  nonparametric  analog  to  the 
two-way  analysis  of  variance  procedure  is  performed.  The 
data  are  presented  in  a  kXn  two-way  table  where  k  is  the 
number  of  subjects  and  n  is  the  number  of  trials  (ranks) . 

The  subjects  cannot  -  as  the  analysis  will  show  -  be 
considered  a  single,  homogeneous  random  sample  because  of 
certain  relationships  between  them.  The  rows  in  Tables  II 
and  III  indicate  subjects,  the  columns  are  ranks,  i.e.  the 
scores  have  been  replaced  by  ranks  within  subjects.  While 
the  row  totals  are  constant  (sum  of  ranks)  the  column  totals 
are  only  expected  to  be  the  same  if  there  is  no  difference 
in  the  observations  that  lead  to  the  ranks.  Since  the  sum 
of  deviations  of  observed  column  totals  from  expected  column 
totals  k(n+l)/2  is  zero,  the  sum  of  squares  of  the  deviations 
will  reveal  differences  in  observations,  i.e.  homogeneity  or 
lack  of  homogeneity  in  the  population  of  eight  subjects.  The 
first  thing  which  is  evident  from  this  table  is  that  the 
behaviour  of  the  ranks  classifies  them  into  two  groups.  Thus 
note  that  at  time  period  7  subjects  1,  2,  3,  8  have  low 
spatial  ranks,  while  subjects  4,  5,  6,  7  have  high  ranks. 
Similar  opposite  behaviour  is  seen  at  other  time  periods, 
e.g.  12,  16,  19,  23,  30.  The  same  effect  can  be  seen  from 
the  verbal  ranks  (Table  III) ;  in  fact,  it  shows  up  from  the 
verbal  ranks  in  most  cases  in  the  same  time  period  as  it 
does  from  the  spatial  ranks. 


TABLE  II.  TWO-WAY  LAYOUT  OF  SUBJECTS  AND  THE  RANKS  OF  SPATIAL  TEST  SCORES  WITHIN  EACH 
SUBJECT.  WITHIN  LIMITS  THERE  SHOULD  BE  SOME  COHERENCE  BETWEEN  SUBJECTS. 
HOWEVER,  ONE  COULD  HAVE  CYCLIC  PHENOMENA  WITH  (RELATIVELY)  RANDOM  PHASES 
AND  THIS  WOULD  DECREASE  COHERENCE  OR  CONCORDANCE  BETWEEN  SUBJECTS. 
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EACH  SUBJECT. 


This  observation  is  crucial.  We  will  now  show  that  for 


overall  eight  subjects  the  Friedman  statistic  shows  no  common 
effect,  but  that  this  changes  drastically  if  the  statistic 
(or  the  equivalent  Kendall's  coefficient  of  concordance)  is 
applied  within  subgroups. 

The  random  variable  S,  the  test  statistic  for  the  Friedman 
two-way  analysis  of  variance,  under  the  hypothesis  of  no 
difference  between  observations  (ranks)  therefore  has  a 
sampling  distribution  as  follows: 


S 


k(n+l) 

2 - 


2 


Friedman's  statistic  F  is  a  linear  function  of  this  random 
variable  and  is  defined  as: 


=  12  s_  =  12 
kn (n+1)  kn (n+1) 


n 

R?  -  3k  (n+1) 

j-1 


It  can  be  treated  as  a  chi-square  variable  with  n-1  degrees 
of  freedom  as  long  as  n  is  greater  than  7.  The  results  are 
given  in  Table  IV.  The  rejection  region  for  the  test  of  the 
hypothesis  of  no  difference  in  treatments  (ranks)  with  n  =  32 
consists  of  all  values  greater  than  45  or  a  =  .05  and  of 
all  values  greater  than  52.2  for  a  =  .01.  This  means  -  since 
all  F-values  of  Table  IV  definitely  fall  into  this  region  - 
rejection  of  the  hypothesis  of  no  difference  at  the  .05  level. 
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At  the  .01  level  the  overall  F-statistic  for  the  spatial 
task  falls  just  below  the  rejection  region. 


overa 1 l 

group  1238 

group  4567 

verbal 

sum  R? 

618448 

167280 

171418 

F  stat 

86.47 

79.23 

90.98 

spatial 

sum  Rj 

592805 

172177 

170624 

F  stat 

50.05 

93.14 

88.73 

TABLE  IV.  LISTING  OF  THE  FRIEDMAN  STATISTIC  FOR  THE  TEST  OF 
THE  HYPOTHESIS  OF  NO  DIFFERENCE  BETWEEN  TREATMENTS 
(INDEPENDENCE  OF  SUBJECTS) . 

F.  KENDALL'S  COEFFICIENT  OF  CONCORDANCE 

The  above  mentioned  statistic  F  provides  a  possibilitiy 
to  test  the  hypothesis  of  no  difference  between  "treatments" 

-  that  results  in  ranks  -  at  any  level  a.  Actually  of  more 
importance  is  homogeneity  or  concordance  between  subjects. 

A  method  to  describe  the  concordance  between  subjects 
and  to  find  a  measure  of  relationship  between  rankings  is 
the  computation  of  Kendall's  coefficient  of  concordance,  W. 
This  coefficient  measures  agreement  between  subjects, 
concordance  between  rankings  or  dependence  of  sample.  It  is 
analogous  to  the  two-sample  measurement  of  concordance; 
however,  it  does  not  range  from  -1  to  1,  but  only  from  0  to  1 . 
The  reason  for  this  is  that  among  more  than  two  samples 
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perfect  discordance  (as  -1  indicates  it  in  the  two-sample 
case)  cannot  be  defined.  Zero  means  no  agreement  between 
subjects  or  independence  of  samples,  1  indicates  perfect 
concordance.  The  statistic  W  is  defined  to  be  the  ratio  of 
S  (as  described  in  the  previous  section)  to  its  maximum 
possible  value  k  [Gibbons,  1971,  pg.  251]. 


lst 


,2  2  ,, 

k  nvn  -1) 

12 


W  = 


IT". 

st 


12  S 


2  2 
k  n  (n  -1) 


As  can  be  seen  from  Table  V  the  complete  group  of  eight 
subjects  is  rather  inhomogeneous,  while  the  two  previously 
identified  subgroups  1,  2,  3,  8  and  4,  5,  6,  7  are  much  more 
homogeneous  within  groups.  Table  V  also  contains  the 
necessary  conversion  of  the  statistic  W  to  a  statistic  Y, 
a  test  statistic  that  allows  a  significance  test  at  level  a. 


overal 1 

group  1238 

group  4567 

verbal 

S 

55072 

27888 

32026 

W 

.31543 

.63893 

.73373 

Y 

78.23 

79.23 

90.98 

spatial 

S 

33917 

32389 

30968 

W 

.19426 

.74205 

.70949 

Y 

48.18 

92.01 

87.98 

TABLE  V.  KENDALL'S  COEFFICIENT  OF  CONCORDANCE  AND  THE  TEST 
STATISTIC  Y  FOR  SIGNIFICANCE  TESTS. 

Y  =  k-  (n-1)  *W  Y  =  k  *  31 • W 


p 


m 
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This  statistic  is  distributed  chi-square  with  n-1  degrees 
of  freedom.  In  a  test  of  the  hypothesis  of  independence 
between  subjects  large  values  of  Y  should  fall  in  the 
rejection  region.  For  n  =  32  and  a  equal  .05,  values  greater 
than  45,  for  a  =  .01  values  greater  than  52.2  have  to  be 
considered  large.  The  result  shown  in  Table  V  is  therefore 
the  same  as  that  of  Friedman's  two-way  layout  discussed 
above . 

G.  FOURIER  ANALYSIS  OF  RAW  DATA  (SASE  VI) 

The  raw  data  without  having  been  normalized  or  smoothed 
previously  were  subjected  to  Fourier  analysis  using  the 
available  SASE  VI  -  program  [LEWIS ,1976  : . 

The  intent  of  this  step  of  the  analysis  was  to  see  if 
the  power  spectra  (or  amplitude)  for  the  verbal  and  spatial 
performance  scores  as  shown  by  Klein  could  be  duplicated. 
Further  it  was  desired  to  see  if  there  were  effectively  any 
peaks  in  the  spectra  at  any  frequencies,  and  to  compare  the 
verbal  and  spatial  spectra.  The  computation  is  as  follows: 
let  the  set  of  data  be  indicated  by  X^,  i  =  1,  2,  .  .  .,  n. 
The  Finite  Fourier  transform  of  this  data  is 

n 

y  -i2nu-l)  •  j/n 
*1  e 

1=1 

7  j  =  0,  1,  .  •  • ,  n— 1 


40 


The  periodogram,  an  estimated  amplitude 


where  i  =  • 

spectrum  of  the  data  is  defined  to  be 


P(j)  = 


I  a  ( j  )  ! 
2!invar  (XI 


■  _  i  n  , 

?  1  1/  •  •  •#^■*“1 

This  estimates  the  spectral  density  of  the  sequence  XQ 

l 

given  by 

00 

f+('jj)  =  ^  1+2  p.  COS(joj)  , 

j=l 

where  the  serial  correlations  are  defined  to  be 

Qj  =  corr (^£X^+ j )  j  ®  Of  il,  i2,  ... 

We  note  that  f+(w)  =  i-  if  the  data  are  independent.  More 
important  are  the  sampling  properties  of  the  periodogram 
points  P(jj*  These  are  approximately  independent  and 
exponentially  distributed  with  mean  value  f+(oj).  In 
particular  if  the  spectrum  is  flat  f+(w)  =  and  the 
periodogram  points  are  approximately  i.i.d.  exponential. 

Before  doing  any  formal  tests  for  departure  from  a  flat 
spectrum,  it  is  worth  looking  at  the  spectra  graphically. 

The  spectra  are  smoothed  by  averaging  over  the  eight  subjects 
at  each  frequency  f ^  (or  j),  as  was  done  by  Klein.  The 
numbers  are  given  in  Tables  VI  and  VII,  and  the  graphs  in 
Figures  5  and  6.  Klein's  spectra  are  essentially  reproduced 


and  the  dominant  feature  in  both  is  the  peak  at  a  frequency 
corresponding  to  a  period  of  four  hours.  If  this  peak  were 
significant,  it  would  correspond  to  the  lunch  time  effect 
noted  in  the  introduction. 

However,  in  the  previous  section  we  have  seen  that  there 
is  an  inhomogeneity  in  the  subjects.  Therefore  smoothed 
periodograms  for  the  subjects  1,  2,  3,  8  and  4,  5,  6,  7  are 
shown  for  both  verbal  and  spatial  tests  (Figures  5  and  6) . 

The  most  interesting  aspect  of  this  split  is  that  the  strong 
similarity  between  the  overall  verbal  and  spatial  periodograms 
disappears.  In  the  spatial  case  (Figure  5)  the  dominant  4 
hour  peak  is  still  apparent.  However,  the  verbal  periodogram 
for  subjects  4,  5,  6,  7  does  not  have  this  peak.  In  fact, 
the  verbal  periodograms  for  the  two  groups  are  dissimilar, 
reinforcing  the  conclusion  of  heterogeneity  in  the  sample  of 
eight  subjects. 

Before  any  other  inferences  can  be  drawn  from  the 
periodograms,  it  is  necessary  to  check  the  significance  of 
the  departure  from  a  flat  spectrum,  since  the  peaks  in  the 
spectra  may  be  due  simply  to  sampling  fluctuations.  This  is 
done  in  the  next  section. 
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TABLE  VI.  THE  PERIODOGRAM  VALUES  FOR  EACH  SUBJECT  AT  EACH  FREQUENCY  f -=j/n; 

j=lf  2,  .  .  . , (n/2 ) -1  FOR  THE  SPATIAL  SCORES.  THE  AVERAGE  J 

(SMOOTHED)  PERIODOGRAM  IS  GIVEN  FOR  EACH  OF  TWO  SUBGROUPS  AND  FOR  THE 
COMPLETE  SET  OF  EIGHT  SUBJECTS.  THESE  VALUES  ARE  GRAPHED  IN  FIGURE  5 


a) 


i 


1.0 

.9 

.8 

.7 

.6 

.5 

P(j)  -4 
.3 
.2 
.1 


1  2  3  4  5  6  7  8  910111213141516 

j  =  nfj 


FIGURE  5.  GRAPH  OF  SMOOTHED  PERIODOGRAMS  FOR  SPATIAL  SCORES 
OF  THE  TWO  SUBGROUPS  (a  &  b)  AND  FOR  ALL  SUBJECTS 
(c) .  IN  ALL  CASES  THE  4  HOUR  PERIOD  COMPONENT 
( j=2 )  IS  THE  MAXIMUM  PEAK,  BUT  IN  THE  CASE  OF 
SUBJECTS  1238  THE  96-MINUTE  PEAK  IS  NOT  LARGE. 


TABLE  VII.  THE  PERIODOGRAM  VALUES  FOP  EACH  SUBJECT  AT  EACH  FREQUENCY  f.=j/n; 

j=l,  2,  .  .  .  ,  (n/2) -1  FOR  THE  VERBAL  SCORES.  THE  AVERAGE  3 

(SMOOTHED)  PERIODOGRAM  IS  GIVEN  FOR  EACH  OF  TWO  SUBGROUPS  AND 
FOR  THE  COMPLETE  SET  OF  EIGHT  SUBJECTS.  THESE  VALUES  ARE  GRAPHED 
IN  FIGURE  6 . 


j-nf  j 


FIGURE  6.  GRAPH  OF  SMOOTHED  PERIODOGRAMS  FOR  VERBAL  SCORES 
OF  THE  TWO  SUBGROUPS  (a  &  b)  AND  FOR  ALL  SUBJECTS 
(c) .  THE  DIFFERENCE  IN  THE  SPECTRA  OF  THE  TWO 
SUBGROUPS  IS  STRIKING. 
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H.  TEST  FOR  FLAT  SPECTRUM 

The  pooled  Fourier  analysis  performed  by  the  experiment¬ 
ers  [Klein,  1979]  yielded  a  power  spectrum  in  which  dominant 
peaks  occured  at  4  hours,  at  96  minutes  and  at  37  minutes. 

A  Kolmogorov-Smirnov  test  for  flat  spectrum  was  applied  to 
test  the  significance  of  all  peaks  together.  This  tests  the 
indepe  dnc^  of  the  32  data  points  which  go  into  making  the 
spectra.  _'he  idea  behind  this  is  that  since  the  p(j)’s  are 
approximately  i.i.d.  and  exponential,  then 

j 

p  (i) 

i=l 

"  “n  " - 

E 

i=l 

—  i  n  n 

/  3  =  -*-,  .  .  •  » 2" —  - 

are  order  statistics  from  a  uniform  (0,  1)  distribution 
under  the  null  hypothesis  of  a  flat  spectrum.  [Cox  &  Lewis, 
1966]  But  this  is  the  canonical  form  of  the  usual  tests  of 
goodness-of-fit  and  thus  the  Kolmogorov-Smirnov  and  Anderson- 
Darling  tests  have  been  applied.  [Cox  &  Lewis,  1966]  The 
results  are  shown  in  Table  VIII.  For  the  verbal  spectra  it 
is  striking  that  for  subjects  1,  2,  3,  8  the  K-S  test  is 
highly  significant,  while  it  is  not  significant  for  subjects 
4,  5,  6,  7  at  any  level.  Therefore  the  conclusion  might  be 
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that  as  far  as  verbal  tests  are  concerned  there  is  a  strong 
lunch  time  effect  in  one  case  and  not  in  the  other.  There 
is  a  much  smaller  chance  that  spatial  tests  were  affected  by 
the  lunch  time  effect  for  the  group  4,  5,  6,  7.  This  would 
confirm  some  opposition  for  verbal  and  spatial  tests,  with 
the  qualification  that  it  is  a  subject  dependent  result. 

It  would  be  difficult  to  extract  from  any  of  these  spectra, 
jointly  or  spearately,  any  indication  of  a  90-min  cycle  in 
the  data.  The  90-min  peak  is  big  when  the  4  hour  peak  is 


big,  indicating  that  one  sees  a  harmonic  of  the  long  cycle. 


Subject 

i 

2 

3 

4 

5 

6 

7 

8 

spatial 

KS 

1.008 

0.860 

0.741 

0.908 

1.303 

two-sided  test 

p* .80  KS3 1 .032 

■ 

■ 

f 

nf 

nf 

f 

nf 

p-  .90 

KS31 . 18 

H 

B 

f 

nf 

nf 

f 

nf 

p- .  95 

KS»1.308 

B 

f 

nf 

f 

f 

nf 

p«.98 

KS'1.464 

B 

B 

f 

f 

f 

f 

f 

p* .  99 

KS«1.568 

B 

■ 

f 

f 

f 

f 

f 

verbal 

KS 

1.270 

0.786 

0.516 

0.936 

1.760 

two-sided  test 

p* . 80  KS-1.032 

nf 

1 

H 

■ 

1 

■ 

■ 

nf 

P-.90 

KS-1.18 

nf 

B 

B 

B 

nf 

p*  .95 

KS-1.308 

f 

n 

B 

nf 

pm .  98 

KS31 .464 

f 

B 

Sjj 

nf 

p«  .99 

KS* 1 . 568 

f 

■ 

B 

B 

B 

B 

nf 

f»flat  nf»non-flat 


TABLE  VIII.  KOLMOGROV-SMIRNOV  TEST  RESULTS  FOR  DIFFERENT 
p-LEVELS.  FOR  KS-VALUES  HIGHER  THAN  THE  ONES 
ASSOCIATED  WITH  THE  p-LEVEL  ON  THE  LEFT,  THE 
HYPOTHESIS  OF  FLATNESS  IS  REJECTED.  AT  THE  .05 
LEVEL  THIS  IS  ONLY  THE  CASE  FOR  SUBJECTS  3  AND 
8  IN  THE  VERBAL  SCORES. 
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I .  KENDALL ' S  TAU 

The  previous  sections  have  indicated  some  tendency  of  the 
performance  in  verbal  and  spatial  tasks  -  as  shown  by  the 
scores  -  to  be  negatively  correlated  and  perhaps  out  of 
phase  at  some  frequency.  We  will  pursue  this  further  here. 
Klein's  hypothesis  is  that  a  cycle  of  about  96  minutes  exists 
in  the  performance  of  a  human  and  chat  verbal  and  spatial 
performance  are  about  180°  out  of  phase,  i.e.  good  verbal 
performance  goes  together  with  less  good  spatial  performance 
and  vice  versa. 

To  test  the  second  part  of  the  hypothesis  -  the  out-of¬ 
phase  phenomenon  -  assuming  for  the  time  being  that  a  cycle 
does  exist,  Kendall's  tau  coefficient  was  computed  which  is 
a  measure  of  association  between  random  variables  from  any 
bivariate  population. 

The  reasoning  behind  using  Kendall's  tau  as  opposed  to 
the  Pearson  product-moment  correlation  coefficient  (as  in 
paragraph  D  of  this  section)  at  this  point  is  that  tau  is 
invariant  under  all  transformations  of  X  and  Y  for  which 
the  order  of  magnitude  is  preserved;  this  allows  the  use 
of  ranks  instead  of  raw  data. 

Perfect  direct  and  indirect  association  between  X  and  Y 
are  reflected  by  perfect  concordance  and  discordance, 
respectively.  Concordance  or  agreement  is  a  comparative 
type  of  association  and  means  that  large  (small)  values  of 
X  are  associated  with  large  (small)  values  of  Y.  [Gibbons, 
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1971,  pg.  206]  Discordance,  the  opposite,  associates  large 
X-values  with  small  Y-values  and  vice  versa. 


The  Null  hypothesis  in  this  test  is  that  the  and 
are  independent.  The  procedure  to  compute  Kendall's  tau, 
also  called  Theil's  statistic,  is  as  follows:  Order  the 
X,  Y-pairs  such  that  the  X-values  are  in  increasing  order 
(this  is  shown  in  Table  IX) ;  the  ranks  of  the  X^  then  run 
from  1  to  n  (1  to  32  in  the  experimental  data  under 
discussion) .  Now  define  P  as  the  number  of  times  the  rank 
of  a  Y-value  is  greater  than  the  rank  of  any  of  the  following 
Y-values.  Define  Q  as  the  number  of  times  the  rank  of  a 
Y-value  is  smaller  than  the  rank  of  any  of  the  following 
Y-values.  P  and  Q  then  have  to  add  to  the  total  number  of 
comparisons . 


P  =  #  times 

Q  -  #  times 


j  =  i+1,  i+2 


P+Q  = 


r(Yi) 

r(Yi) 


>  r(Y j ) 

>  r(Yj) 
.n 


Now  define  K  to  be  the  difference  P-Q.  K  is  the  difference 
of  two  random  variables  and  is  distribution-free.  K  is 
symmetric  with  a  range  of (-n (n-1) ) /2  to  (n(n-l))/2  and  an 
expected  value  E (K)  =  0;  the  variance,  sigma  squared,  equals 
(n(n-l)  (2n+5))/18  under  the  Null  hypothesis  that  X^_  and  Y^ 
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are  independent.  The  reasoning  for  this  is  the  permutation 
hypothesis  which  states  that  under  assumption  of  independent 
identical  distribution  each  of  N!  permutations  of  the  ranks 
is  equally  likely.  Define  T  to  be  K  divided  by  (n(n-l)),/2. 

For  large  n  the  random  variable  Z  can  be  treated  as  a  standard 
normal  variable.  It  is  a  linear  function  of  T  and  is  defined 
as : 


z  _  3  /m (m-lj  .  T 
/2T2m+5l 

The  results  of  the  test  of  the  hypothesis  of  independence  of 
the  and  is  shown  in  Table  X.  Seven  out  of  the  eight 
subject  results  are  not  significant  at  the  .05  level,  not 
even  at  the  .10  level.  The  hypothesis  of  independence  is 
therefore  accepted.  This  might  be  confounded  by  the 
frequency  effect.  Thus  one  could  look  at  lagged  correlations. 
This  is  pursued  below  in  paragraph  L  of  this  section. 

J.  SPEARMAN'S  COEFFICIENT  R  OF  RANK  CORRELATIONS 

Another  computation  performed  to  be  able  to  test  the 
hypothesis  of  independence  between  the  two  different  kinds 
of  performance  scores  was  finding  Spearman's  coefficient  of 
rank  correlation.  The  proceure  is  as  follows:  The  X  observa¬ 
tions  are  replaced  by  their  rank  within  the  X^-group;  the 
Y  observations  are  replace  by  their  ranks  in  the  Y^-group. 

The  data  then  consist  of  n  sets  of  paired  ranks  and  the 
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TAILE  X.  LISTING  OF  P  IN  KENDALL'S  TAU  COMPUTATION  AND 
STATISTIC  L=K/SIGMA(K)  FOR  SIGNIFICANCE  TEST. 
ONLY  SUBJECT  NO.  2  SHOWS  SIGNIFICANT  SIGNS  OF 
DEPENDENCE;  OVERALL  INDEPENDENCE  IS  ACCEPTED. 
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coefficient  R  can  be  computed.  Define  to  equal  rank(X^) 
and  define  to  equal  rank(Y^);  define  to  equal  the 
difference  of  R^  and  S^.  The  coefficient  R  is  then  defined 
to  be : 

6-Z  D2 

R  =  1  -  — - 

m (n  -1) 


” R  measures  the  degree  of  correspondence  between  rankings, 
instead  of  between  actual  variate  values,  but  it  can  still 
be  considered  a  measure  of  association  between  the  samples 
and  an  estimate  of  the  association  between  X  and  Y  in  the 
continuous  bivariate  population."  [Gibbons,  1971,  pg.  226] 

As  displayed  in  Table  XI,  with  the  exception  of  subject 
number  2,  none  of  the  subjects  shows  significant  rank 
correlation  to  reject  the  hypothesis  of  independence  between 


the  verbal  and  spatial 

scores . 

R1  : 

-.19098 

Z  : 

-1.06 

Z=R (n-1)  ** 

R2  : 

-.42223 

Z  : 

-2.35 

R3  : 

-.06745 

Z  : 

-0.38 

mean  of  R 

: - . 1504 

R4  : 

-.04857 

Z  : 

-0.27 

std.  dev. 

:  .1498 

R5  : 

-.28556 

Z  : 

-1.59 

R6  : 

-.18054 

Z  : 

-1.01 

R7  : 

+.028776 

Z  : 

+0.16 

R8  : 

-.03629 

Z  : 

-0.20 

TABLE  XI.  SPEARMAN'S  COEFFICIENT  OF  RANK  CORRELATION  AND  CONVERSION  TO 
STANDARD  NORMAL  RANDOM  VARIABLE. 

WITH  THE  EXCEPTION  OF  SUBJECT  NO.  2  NO  SUBJECT  SHOWS  SIGNIFI¬ 
CANT  RANK  CORRELATION  TO  REJECT  THE  HYPOTHESIS  OF  INDEPENDENCE 
BETWEEN  THE  VERBAL  AND  SPATIAL  PERFORMANCE  SCORES. 


K.  RANK  SUM  MINUS  EXPECTED  RANK  SUM  UNDER  Hq 

Another  way  pursued  to  examine  the  hypothesized  out-of¬ 
phase  phenomenon  was  a  plot  of  the  sums  of  paired  ranks  minus 
the  expected  rank  sum  under  H  ,  n+1  =  33,  vs.  time  period. 
Since  under  the  hypothesis  of  Klein  the  performance  of  verbal 
and  spatial  tasks  should  be  180°  out  of  phase.  This  would 
mean  good  performance  (i.e.  high  ranks)  in  one  task  should 
go  together  with  poor  performance  (i.e.  low  ranks)  in  the 
other.  The  expected  value  of  this  statistic  is  zero  under 
Klein's  hypothesis. 

The  procedure  to  develop  this  statistic  is  as  follows: 
Replace  the  raw  data  by  their  ranks  in  each  group  of  X  and  Y. 
Add  the  ranks  of  each  pair  X  and  Y  and  subtract  33,  which  is 
n+1  for  the  data  under  discussion.  If  the  performance  in  the 
two  different  tasks  really  is  out  of  phase  the  resulting 
values  should  be  around  zero.  Very  negative  (or  positive) 
results  indicate  an  overall  deviation  to  poor  (or  good) 
performance.  Negative  values  can  be  expected  in  the  begin¬ 
ning  of  the  testing  session  due  to  a  learning  curve  effect 
(LC)  as  well  as  later  in  the  experiment  due  to  fatigue. 

Under  Klein's  composite  hypothesis  of  a  96  minute  cycle 
and  an  out-of-phase  phenomenon  for  the  performance  on  the 
two  tasks  one  would  expect  this  measurement  to  ^  te 

around  zero  following  the  96  minute  cycle.  The  result  as 
displayed  in  Figure  7  shows,  however,  that  -  if  anything 
at  all  -  it  follows  a  slow  wave  movement  which  is  presumed 
to  be  induced  by  the  lunch  time  effect. 


w 
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FIGURE  7.  PLOT  OF  THE  RANK  SUMS  OF  BOTH  VERBAL  AND  SPATIAL 

RANKS  AT  EACH  INTERVAL  MINUS  THE  EXPECTED  RANK  SUM 
UNDER  KLEIN'S  HYPOTHESIS  OF  OUT-OF-PHASENESS . 

LOW  VALUES  CAN  BE  EXPECTED  AT  BOTH  ENDS  OF  THE 
GRAPH;  DUE  TO  A  LEARNING  CURVE  IN  THE  BEGINNING 
AND  DUE  TO  FATIGUE  AT  THE  END  OF  THE  EXPERIMENT. 
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L.  LAG  SHIFTING  OF  RANKS 

To  investigate  the  out-of-phase  hypothesis  further,  an 
analog  to  an  auto-correlation  procedure  was  devised.  After 
replacing  the  raw  scores  by  their  ranks,  one  column  of 
performance  socre  ranks  was  kept  stationary  and  the  other 
was  shifted  ("lagged")  downwards  for  negative  lag,  upwards 
for  positive;  i.e.  in  this  procedure  spatial  score  (rank) 
position  1  was  paired  with  verbal  rank  position  2  for 
positive  lag  1;  spatial  position  1  was  paired  with  verbal  3 
for  positive  lag  2,  etc...  .  For  each  lag  in  the  range  from 
0  to  ±15  the  simple  Pearson  product-moment  correlation  was 
computed  and  plotted  as  shown  in  Figure  8. 

The  reasoning  for  this  procedure  is  the  following:  Under 
Klein's  hypothesis  of  out-of-phaseness  one  can  expect  a  high 
positive  correlation  of  two  performance  scores  after  about 
3-4  shifts  (lags) .  Since  the  scores  were  collected  every 
15  minutes,  3  to  4  shifts  correspond  to  45  to  60  minutes. 

With  a  hypothesized  cycle  period  of  80  to  110  minutes, 
measurements  that  were  initially  out  of  phase  would  then  be 
in  phase  and  should  be  highly  positively  correlated. 

The  results  for  all  subjects  are  similar  to  the  repre¬ 
sentative  plot  of  Figure  8,  where  a  cycle  of  the  hypothesized 
period  cannot  be  found.  The  symmetry  of  the  correlation  plot, 
however,  is  striking:  In  all  eight  subjects  the  highest 
positive  correlation  is  found  after  8  to  10  shifts,  indicating 
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a  period  of  4.5  to  5  hours.  It  could  also  -  which  is  probably 
more  realistic  -  simply  indicate  the  positive  correlation 
between  poor  scores  at  the  beginning,  after  about  4  hours  at 
lunch  time  and  again  4  hours  later  due  to  fatigue. 


FIGURE  8 .  PLOT  OF  THE  PEARSON  PRODUCT-MOMENT  CORRELATION  FOR 

UP  TO  15  SHIFTS  IN  POSITIVE  AND  NEGATIVE  DIRECTIONS, 
RESPECTIVELY.  THE  EXPECTED  STRONGLY  POSITIVE 
CORRELATIONS  AT  ABOUT  3-4  SHIFTS  CANNOT  BE  DETECTED. 
REPRESENTATIVE  PLOT;  HERE;  SUBJECT  NO.  3. 
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III.  CONCLUSION 


Considering  the  impact  and  importance  of  research  in  this 
part  of  cycle  research  as  described  in  Section  I,  paragraph 
F,  Klein  and  Armitage  certainly  investigated  an  interesting 
aspect  of  biological  rhythmicity.  Based  on  previous  sleep 
research  their  hypothesis  was  a  logical  one.  However,  based 
on  the  research  and  analysis  that  went  into  the  compositon  of 
this  thesis,  their  findings  cannot  be  agreed  to  wholeheartedly. 
The  inhomogeneity  in  the  small  group  of  their  subjects  seems 
to  be  a  fact  that  has  not  been  considered  before.  The  influ¬ 
ence  of  the  24-hour  cycle,  the  circadian  rhythm,  on  the  test 
procedures  and  their  timing  seems  sufficient  to  account  for 
any  rhythmicity  detected  by  Klein  and  Armitage. 

While  further  research  into  a  possible  ultradian  rhythm 
is  necessary  and  will  be  interesting,  it  needs  to  be  evalu¬ 
ated  if  the  impact  of  such  a  cycle  -  should  it  be  found 
conclusively  -  is  worth  the  effort  in  time  and  manpower.  It 
also  seems  as  if  a  useful  predictability  cannot  be  estab¬ 
lished  since  any  possible  influence  of  a  possible  ultradian 
rhythm  is  completely  overshadowed  by  the  variability  among 
subjects . 

It  is  suggested  that  a  future  experiment  make  use  of 
some  of  the  discussion  objects  of  section  I,  paragraph  I, 
that  were  not  obvious  to  Klein  and  Armitage’ s  experimental 


set-up. 
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